Ultra-Efficient Content Addressable Memory for Tunable GPU Approximation
نویسندگان
چکیده
In this paper, we describe a resistive configurable associative memory (ReCAM) that enables selective approximation and asymmetric voltage overscaling to manage delivered efficiency. The ReCAM structure matches an input pattern with pre-stored ones by applying an approximate search on selected bit indices (bitline-configurable). To further reduce energy, we explore proper ReCAM sizing, various configurable search operations with low overhead voltage overscaling, and different ReCAM update policies. Experimental result on the AMD Southern Islands GPUs for eight applications shows bitlineconfigurable ReCAM achieve on average to 43.6% energy savings with an acceptable quality loss of 10%.
منابع مشابه
Resistive CAM Acceleration for Tunable Approximate Computing
The Internet of Things significantly increases the amount of data generated, straining the processing capability of current computing systems. Approximate computing is a promising solution to accelerate computation by trading off energy and accuracy. In this paper, we propose a resistive content addressable memory (CAM) accelerator, called RCA, which exploits data locality to have an approximat...
متن کامل4.5 Ultra-low Energy Memory Devices
Memory-based computing using associative memory has emerged as a promising solution to reduce the energy consumption of important classes of streaming applications such as multimedia by avoiding redundant computations. In associative memory, a set of frequent patterns that represent basic functions are pre-stored in ternary content addressable memory (TCAM) and reused. The primary limitation to...
متن کاملAssociative Memristive Memory for Approximate Computing in GPUs
Associative memory, in the form of lookup tables, is a promising approach to improving energy efficiency by enabling computing-with-memory. A processing element can be tightly coupled with an associative memory where function responses are pre-stored. Associative memories can recall function responses for a subset of input values therefore avoiding the actual function execution on the processin...
متن کاملA TCAM - Based Distributed Parallel IP Lookup Scheme and Performance Analysisi
Using Ternary Content Addressable Memory (TCAM) for high speed IP address lookup has been gaining popularity due to TCAM's deterministic performance and fast lookup performance. However, restricted by the slow improvement of memory accessing speed, the route lookup engines for next-generation terabit routers demand exploiting parallelism among multiple TCAM chips. Traditional parallel methods a...
متن کاملAn In-Storage Implementation of Smith-Waterman in Resistive CAM
An in-storage implementation of the Smith-Waterman sequence alignment algorithm on a resistive content addressable memory (ReCAM) based storage is proposed. The ReCAM native compare operation is used to find matching basepairs in a fixed number of cycles, regardless of the sequence length. Our in-storage implementation is simulated and compared to state-of-the-art systolic arrays and GPU-based ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017